Phonetic segmentation of speech signal using local singularity analysis
نویسندگان
چکیده
This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain and provides efficient means for studying local non-linear dynamics of complex signals. In this paper we introduce an efficient way for estimation of these parameters and then, we show that they convey relevant information about local dynamics of the speech signal that can be used for the task of phonetic segmentation. We thus develop a two-stage segmentation algorithm: for the first step, we introduce a new dynamic programming technique to efficiently generate an initial list of phoneme-boundary candidates and in the second step, we use hypothesis testing to refine the initial list of candidates. We present extensive experiments on the full TIMIT database. The results show that our algorithm is significantly more accurate than state-of-the-art ones.
منابع مشابه
A novel approach to phonetic segmentation through local singularity analysis of speech
This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF provides efficient means for studying local non-linear dynamics of complex signals. The formalism is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain. In this paper we ...
متن کاملA novel text-independent phonetic segmentation algorithm based on the microcanonical multiscale formalism
We propose a radically novel approach to analyze speech signals from a statistical physics perspective. Our approach is based on a new framework, the Microcanonical Multiscale Formalism (MMF), which is based on the computation of singularity exponents, defined at each point in the signal domain. The latter allows nonlinear analysis of complex dynamics and, particularly, characterizes the interm...
متن کاملSpeech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers
In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...
متن کاملA Novel Method for Arabic Consonant/Vowel Segmentation Using Wavelet Transform
Automatic speech segmentation is a key step for building large vocabulary and continuous speech recognition systems. An alternative method is to use manual speech segmentation, which is tedious, time consuming, subjective and error prone. Many automatic speech segmentation methods have been proposed based on linguistic information such as phonetic transcription (phonetic string) but for real ti...
متن کاملAutomatic Phonetic Segmentation for a Speech Corpus of Hebrew
This paper presents our study on different phonetic segmentation methods based on hidden Markov models evaluated against a Hebrew speech corpus. We investigated methods for fully automatic phonetic segmentation using only the corpus which should be segmented and automatically generated phonetic transcriptions. A new method for phonetic boundary correction based on spectral variation of the spee...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Digital Signal Processing
دوره 35 شماره
صفحات -
تاریخ انتشار 2014